A Meta Classifier by Clustering of Classifiers
نویسندگان
چکیده
To learn any problem, many classifiers have been introduced so far. Each of these classifiers has many strengths (positive aspects) and weaknesses (negative aspects) that make it suitable for some specific problems. But there is no powerful solution to indicate which classifier is the best classifier (or at least a good one) for a special problem. Fortunately the ensemble learning provides us with a powerful approach to prepare a near-to-optimum classifying system for any given problem. How to create a suitable ensemble of base classifiers is the most challenging problem in classifier ensemble. An ensemble vitally needs diversity. It means that if a pool of classifiers wants to be successful as an ensemble, they must be diverse enough to cover the errors of each other. So during creation of an ensemble, we need a mechanism to guarantee the ensemble classifiers are diversity. Sometimes this mechanism is to select/remove a subset of the produced base classifiers with the aim of maintaining the diversity among the ensemble. This paper proposes an innovative ensemble creation named the Classifier Selection Based on Clustering (CSBC). The CSBC guarantees the necessary diversity among ensemble classifiers, using the clustering of classifiers technique. It uses bagging as generator of the base classifiers. After producing a large number of the base classifiers, CSBC partitions them using a clustering algorithm. After that by selecting one classifier from each cluster, CSBC produces the final ensemble. The weighted majority vote method is taken as aggregator function of the ensemble. Here it is probed how the cluster number affects the performance of the CSBC method and how we can choose a good approximate value for cluster number in any dataset adaptively. We expand our studies on a large number of real datasets of UCI repository to reach a decisive conclusion.
منابع مشابه
Classifier Ensemble Framework: a Diversity Based Approach
Pattern recognition systems are widely used in a host of different fields. Due to some reasons such as lack of knowledge about a method based on which the best classifier is detected for any arbitrary problem, and thanks to significant improvement in accuracy, researchers turn to ensemble methods in almost every task of pattern recognition. Classification as a major task in pattern recognition,...
متن کاملImproving Accuracy in Intrusion Detection Systems Using Classifier Ensemble and Clustering
Recently by developing the technology, the number of network-based servicesis increasing, and sensitive information of users is shared through the Internet.Accordingly, large-scale malicious attacks on computer networks could causesevere disruption to network services so cybersecurity turns to a major concern fornetworks. An intrusion detection system (IDS) could be cons...
متن کاملA Novel Ensemble Approach for Anomaly Detection in Wireless Sensor Networks Using Time-overlapped Sliding Windows
One of the most important issues concerning the sensor data in the Wireless Sensor Networks (WSNs) is the unexpected data which are acquired from the sensors. Today, there are numerous approaches for detecting anomalies in the WSNs, most of which are based on machine learning methods. In this research, we present a heuristic method based on the concept of “ensemble of classifiers” of data minin...
متن کاملFault Detection of Bearings Using a Rule-based Classifier Ensemble and Genetic Algorithm
This paper proposes a reduct construction method based on discernibility matrix simplification. The method works with genetic algorithm. To identify potential problems and prevent complete failure of bearings, a new method based on rule-based classifier ensemble is presented. Genetic algorithm is used for feature reduction. The generated rules of the reducts are used to build the candidate base...
متن کاملCombining Classifier Guided by Semi-Supervision
The article suggests an algorithm for regular classifier ensemble methodology. The proposed methodology is based on possibilistic aggregation to classify samples. The argued method optimizes an objective function that combines environment recognition, multi-criteria aggregation term and a learning term. The optimization aims at learning backgrounds as solid clusters in subspaces of the high...
متن کامل